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1. INTRODUCTION 

Numerous studies and research have been devoted to increasing the reliability of diagnosing and 
evaluating the technical condition of power transformers. Increasing the reliability (reducing the error) of the 
equipment condition assessment is one of the key tasks of technical diagnostics, regardless of the scope of 
application, the methods, and the diagnostic tools used. The urgency of the problem is associated with the 
severity of the consequences (costs, damage) that arise as a result of errors in the diagnosis, based on which 
untimely and unjustified decisions are made to withdraw equipment for repair or refuse to repair. Numerous 
studies have been devoted to solving this problem, the results of which are particularly reflected in the 
following publications [1]-[5]. 

The reliability of diagnostics is usually understood as a numerical characteristic of the 
correspondence of the diagnostic results to the actual technical condition of the object [3]. It is customary to 
distinguish between instrumental and methodical reliability of diagnosis [1]. Instrumental reliability is 
determined by the composition and stability of the object's diagnostic parameters, the specified tolerances, as 
well as the accuracy, sensitivity, and condition of the measuring instruments [4], [5]. Methodical reliability, 
as a rule, is associated with the processing of measurement results, the choice of diagnostic features, and 
criteria for assessing the technical condition of the equipment [6]-[9]. 
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One of the promising directions for improving the methodical reliability of diagnosing oil-filled 
power transformers using the results of various control methods is the use of statistical solutions based on the 
processing of multi-parameter measurement data [10]-[14]. Methods for transformers diagnosing have 
different frequencies of application, sensitivity to the occurrence of malfunctions, and, as a result, different 
information content in terms of statistical estimates. The most informative methods include methods for early 
detection of developing defects in transformer equipment, such as analysis of dissolved gases in oil (DGA), 
vibration diagnostics, and thermal diagnostics. These methods allow generating a representative sample of 
data for a relatively short period of operation of the equipment, which is a prerequisite for the application of 
statistical classification. 

The Bayes method successfully solves the problems of statistical classification and pattern 
recognition [15], [16]. The method allows you to adapt the probabilities of the outcomes of random events to 
the newly emerging a priori information [17], [18]. However, the wide use of the Bayes method for the 
development of effective practical applications in the diagnosis of transformers is hindered by the 
multimodality and multidimensionality of statistical distributions of controlled parameters, as well as the 
nonlinear separability of classes of states [19], [20]. Overcoming these limitations requires a special approach 
and is an actual task. The article is devoted to the development of a statistical approach in the direction of 
using the Bayesian classifier as a regular effective tool for improving the methodological reliability of 
recognizing defects in oil-filled transformer equipment based on the results of DGA. The results of the 
conducted studies touch upon and discuss aspects of the diagnostic value and stochastic nature of the 
obtained solutions. 


2. THE MAIN THEORETICAL PROVISIONS 

The statistical approach to the problems of technical diagnostics of electrical equipment (EE) is 
based on the presence of a representative sample of experimental data from a certain general totality, which 
corresponds to a certain distribution law with statistical moments of this distribution. This position made it 
possible to apply the known methods of statistical analysis to the solution of many fundamentally important 
diagnostic problems, for example, as the formation of a reliable image of defects, determination of admissible 
and maximum permissible values of controlled parameters, identification, and formalization of practically 
significant statistical dependencies [14]-[16]. 

As a rule, in the operation of EE, the formation of samples of experimental data is preceded by the 
determination of a set of informative controlled parameters (signs of defects), which will play the role of 
random variables (RV). The dimension of the initial feature vector X = {x1,X2, ...,X,} is the parameter on 
which the reliability of the obtained diagnostic evaluations depends critically. The fact is that each RV X - 
vector component often has its own statistical distribution with its numerical characteristics, which 
significantly complicates the integral assessment of the feature vector for the formation and separation of 
classes of states EE. Reduction of feature space dimensional simplifies transformations and facilitates the 
solution of the statistical classification problem. To reduce the feature space, methods based on the exclusion 
of dependent and insignificant components are applicable (factor analysis method, principal component 
analysis method) which, however, do not eliminate the loss of useful diagnostic information [17], [18]. 

One of the methods using the reduction of the initial space of the controlled features through a 
special transformation in the form of a nonlinear function of the primary diagnostic parameters (1) was 
proposed in [12]. The method used for the DGA of power transformers (PT) introduces a generalized feature 
D that converts a multidimensional space X of parameters (concentrations of diagnostic gases 


(Ai, ppm, i= 1.7)) into a one-dimensional RV with changes on the numerical axis in the interval 0 + 00): 


el Gy 
Da, Gan) 


i=1 i=1 


Aimax (1) 
here Aimax,ppm - preset limits for the concentration of diagnostic gases. 

An adequate replacement of the random vector of gas concentrations {A;} by a scalar discrete RV 
Dallows moving from a multidimensional problem to the study of the properties of a one-dimensional 
random distribution. In addition, on the positive axis D € 0 + œ) the dichotomy of the PT state classes is 
distinguished: 


Class S; state "normal"; 
Class S, state "deviation from the normal (2) 
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The decisive rule that establishes an unambiguous correspondence between the presence of a 
developing defect in the PT, the value of a generalized diagnostic feature D, and a set of classes of the 
technical condition of the equipment can be formed only after determining the boundary of the dichotomy of 
classes (2). Under conditions of operation of a group of similar PT, a random implementation is obtained 
based on a single DGA protocol. Taking into account the composition of the PT group and the duration of 
their operation period (on average 5 years) a representative sample of RV can be formed, which is subjected 
to statistical analysis to verify the distribution law and calculate the statistical moments in each of the classes 
of states. To perform the initial differentiation of the dichotomy of state classes, the criterion of "boundary 
concentrations" [19] is used, according to which: 


Ai S Aimax E S1; Ai > Aimax E S2 (3) 


due to the possibility of starting classification according to criterion (3), two training sets of RV D for the 

selected dichotomy can be formed. 

A statistical analysis of distributions D for each of the classes of states is carried out with the 
determination of their numerical and integral characteristics, as well as with the testing of the hypothesis of 
belonging to a certain distribution law. Numerous studies of DGA statistics on different control groups PT 
110-220 kV [20], [21] allowed us to identify and justify several characteristic features of the distribution of 
RV D: 

a. In most practical cases, the statistical distributions of RV D in the state classes S4, S2 and are mixtures of 
several homogeneous distributions. If it is possible to separate them, additional diagnostic information 
appears, which is valuable for substantiating decision-making rules for the further operation of the PT. 

b. The width of the range of change of RV D in the class S, is due to: 

— The difference in the service life of the PT of the control group: the aging of structural elements 
gradually increases the concentration of characteristic gases and, as a consequence, the value D; 

— Periodic corrective actions for long-term operating PT: corrective action with oil degassing reduces 
gas concentrations, with them and the values D, making them comparable with the values 
characteristic of new PT. 

c. The width of the range of RV D changes in class Sz is primarily due to the varying degrees of criticality 
(stage of development) of defects detected in PT; 

d. As a rule, the RV D distributions in each of the classes are two-parameter and obey one of the laws: 
normal, log-normal, gamma, which opens up possibilities for applying the significant advantages of the 
Bayesian classifier when forming the dichotomy interface of classes state PT [20]. One of the invaluable 
diagnostic evaluations of the merits of the statistical Bayesian classifier based on the likelihood ratio is 
the possibility of minimizing the total error of defect recognition in the EE [21], [22]. Moreover, along 
with an assessment of the belonging of the current state of EE to one of the distinguished classes of states, 
the probability of this assessment can also be determined. 

The Bayesian classifier, formed for a given dichotomy of classes S, and S}, satisfying all these 
requirements, is represented by the expression (4): 


P(S,) 
P(S2) 


In[p(D/S2)] — In[p(D/S,)] = In | (4) 


here: p(D /S;) - conditional probability densityD (j = 1,2); P(S;) a priori probabilities of the state of the PT 
P(S1) 
P(S2) 
normal or close to it, expression (4) is transformed into a quadratic form with a strict analytical solution (5): 


belonging to the jth class; - likelihood ratio. For a random variable D, distributed according to the law 


_ (Mı: of — M, - of + VR) i 
Dmax = (02 — o? (5) 


where: Dmax - mathematical model of the interface between the classes of states of PT; VR - a function of the 
numerical characteristics of a random attribute in each j class of transformer states (Mj-mathematical 
expectations; o;-standard deviation). In the case of a normal distribution of RV D the approximate 
mathematical model (6) can be contrasted to the complete interface model of the dichotomy of state 
classes (5): 


Dmax * Mı +k-o, (6) 
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which satisfying the “3-sigma rule” for the normal statistical distribution D in the class of states S4. Studies 
have established a fairly good agreement between the results of calculating Dmax using the exact (5) and 
approximate (6) models. In addition, model (6) allows you to adjust the Dmax value by selecting the 
computational constant k = 2 + 3 according to the criterion min[e(k)], where e(k) - the estimate of the total 
error of defect-recognition in PT, including error estimates of the first and second kind: €, - “false anxiety” 
and £2- “defect skipping”. Based on the foregoing, we can formulate the following decision-making rules for 
recognizing PT operational status classes: 


D < Dmax State class S,; D > Dmax State class S, (7) 


3. CALCULATION RESULTS, ANALYSIS AND DISCUSSION 

In the computational part of the study, the situation with one of the block transformer TPS (TDN- 
250000/220 kV) of 1992 is considered. In August 2006, according to diagnostic data, a developing thermal 
defect in the high-temperature range 0 > 700 °C was detected in the transformer. Further operation of the PT 
was accompanied by an increase in the concentration of hydrocarbon gases: C,H,-ethylene, CH,-methane, 
C,H,-ethane, as well as CO-carbon monoxide and CO ,-carbon dioxide. The center of the defect was 
presumably located in the lower part of the yoke of the magnetic circuit, where access was excluded without 
completely disassembling the structure of the active part (that is, performing an expensive overhaul). During 
the operation, it was decided to continue the operation of the PT under load with a frequent sampling of oil 
for DGA and its periodic degassing. In this condition, the PT was operated on until March 2013. During this 
time, the development of the defect has passed into a critical phase with the threat of thermal damage to the 
cellulose insulation. As a result, the DGA retrospective comprised 146 protocols, of which 57 (by criterion 
(3)) belonged to the state class S,, and 89-to the class S2. Figure 1 shows the relative frequency D histograms 
for the selected dichotomy of state classes. The area of intersection of the histograms in the classes S, and Sz 
determines the total error in recognizing the state of the PT, the estimate of which is € = 3.42%. The 
numerous characteristics of the distributions for the class dichotomy are given in Table 1. The Dmax 
calculations using models (5) and (6) showed fairly close results of 0.7351 and 0.7347, respectively. 


= c 
ee E Class S: W Class S2 


rA 


O 0.28 0.56 0.84 1.12 1.4 1.68 1.96 2.24 2.52 2.8 3.08 3.36 3.64 3.92 4.2 D o.e 


Figure 1. Histograms of relative frequencies RV D for the dichotomy of classes PT 


With the value of the computational constant in expression (6) k = 2 the following error estimates 
are determined ¢, = 2.08% and £, = 1.34%, which is quite acceptable from the point of view of real 
operational practice. This is quite acceptable from the point of view of real operational practice. Statistical 
analysis of two-parameter distributions of RV D in each of the classes of PT states with verification of the 
initial hypothesis of belonging to one of the above laws was carried out using the Kolmogorov-Smirnov 
criterion [23]-[25]. Calculations with different confidence levels have confirmed the validity of the proposed 
initial hypothesis. The results of testing hypotheses about the statistical law of distribution of RV is shown in 
Figure 2(a) normal distribution in class S,, Figure 2(b) lognormal distribution in class Sj, Figure 2(c) 
lognormal distribution in class S4, and Figure 2(d) gamma distribution in class Sz. The results of the study 
found that with a confidence probability of 0.95, the studied distribution of the random attribute D in the 
class S, satisfies the normal law in the class S, it satisfies the log-normal law. When studying the influence 
of the amount and composition of diagnostic gases (n) dissolved in PT oil on the reliability of the Bayesian 
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classifier model, the following considerations were taken into account: 

— Reduction (to the standard set of gases at n=7) the number of monitored gases will, as expected, lead to a 
decrease in the reliability of the statistical model for recognizing PT states due to the loss of useful 
information; 

— Arbitrariness in the choice of the composition of controlled gases should be limited and based on the 
physicochemical interpretation of defect formation processes in PT. 


Table 1. Numerous characteristics of the distributions RV D for each of the classes of states 


Class States PT The values of the numerical characteristics of the distribution D 
Sı "normal" M,=0.4273 0,=0.1537 
S "deviation from the normal" M,=2.0622 d2=1.0689 
Variable: Var2, Distribution: Normal Variable: Var2, Distribution: Log-normal 


Chi-Square test = 3.30677, df = 2 (adjusted) , p = 0.19140 Chi-Square test = 4.09493, df = 3 (adjusted) , p = 0.25139 


No. of observations 
No. of observations 


Category (upper limits) Category (upper limits) 
(a) (b) 
Variable: Var2, Distribution: Log-normal Variable: Var2, Distribution: Gamma 
Chi-Square test = 3.69493, df = 2 (adjusted) , p = 0.15764 Chi-Square test = 4.06313, df = 3 (adjusted) , p = 0.25473 
16 F 7 T T | 35 
30 
2} 
a S 2} 
2 3 
$ $ 
6 3 15 
2 2 
10} 
0 1 2 3 4 5 6 7 8 9 10 
Category (upper limits) Category (upper limits) 
(c) (d) 


Figure 2. The results of testing hypotheses about the statistical law of distribution of RV: (a) normal 
distribution in class S,, (b) lognormal distribution in class Sz, (c) lognormal distribution in class $4, and 
(d) gamma distribution in class S3 


Taking into account the above, the reduction in the number of gases to n=5 in the traditional 
composition, except for carbon oxide and carbon dioxide, is due to the presence of CO, CO, and in the PT 
oil, regardless of the duration of operation and the presence of internal faults and is associated with the 
chemical composition of the organic dielectric. The reduction in the number of gases to n=3 in the 
composition of CH,, C2H3, C2H4 is due to their status of a "key gas" in poorly classified situations of 
developing thermal defects in the ranges of low and medium temperatures. In addition, the indicated 
composition of gases determines the scheme for interpreting defects in PT using the Duval triangle method 
[26]-[28]. The analysis of the results showed that the reduction in the number of parameters involved in the 
formation of the Bayesian classification model causes a characteristic change in the statistical moments of the 
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RV D in the classes of states S4, S and. So, for example, the mathematical expectation M, tends to increase 
from 0.4273 at n=7, up to 0.4429 at n=3. Similarly, increases the standard deviation g, increases from 0.1537 
to 0.2002 and, as a consequence, the value of Dmax» calculated by expression (6), Figure 3(a) shows decrease 
Dmax by increasing the number of parameters n and Figure 3(b) shows that the total classification error 
increases with increasing number of parameters n. For the automation of statistical calculations and the 
subsequent diagnostic assessment of the PT state according to the criteria (7), an algorithmic and software 
implementation is developed as shown in Figure 4. 


€ Ly 
Dmax = 0.9334exp(-0.034 n) 35] €= 4.1446In(n)- 4.3763 — 
wee i R?= 0.9972 ` R?= 0.9539 i A 
0.82 4 ea A “ 
0.8 N 25 n 
NS 5] Py 
0.78 4 x Pa 
si wN 15 4 7 
a 7 id A 

0.74 4 ~ x os 1.7 
0.72 + ~~ — i 0 K T n 

3 5 7 3 5 7 

(a) (b) 


Figure 3. Dependences of the characteristics of the classifier model on the number of controlled gases: 
(a) boundary between classes of PT states and (b) total classification error according to DGA 


Single \ Group 
Object of Diagnosis (OD) 


Diagnostic Methods 


ECES 


Formation of samples of controlled parameters 


Uniformity check data filtering 


Formation of dichotomy of classes of states OD by (2) 


Starting classification using criterion (3) 


Statistical analysis of data in each class: 


1) Calculation of numerical characteristics; 
2) Checking the law of data distribution in each class. 


The formation of the border of the state classes, Dy by condition min £% (6) 


Recognition of the current operational state of OD based on 
the decision-making rule (7) 


Conclusion issue 


Figure 4. The algorithm of statistical calculations and state estimation of object of diagnosis: dissolved gas 
analysis (DGA); moisture analysis (MA); partial discharge monitoring (PDM); vibration monitoring (VM) 
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The total classification error ¢,% with a decrease in the number of monitored gases n tends to 
decrease as shown in Figure 3. In part, this is explained by the difference in the sample sizes of RV D in 
classes S4, Sz and with variation n=3, 5, 7 as a result of preliminary division into classes according to the 
criterion (3) of the initial set of DGA PT protocols. The dependences (8) obtained by approximating the 
experimental points adequately reflect the adaptive properties of the statistical model of the classifier with 
variations in the number and composition of the controlled parameters (diagnostic gases) participating in its 
formation. 


Dmmaxj 0.9334 x exp(—0.034 x n); £; 4.1446 x 1n(n) — 4.3763 (8) 


At the stage of calculating the RV D according to formula (1), the dependencies (8) can play the role of 
tuning functions that determine the important characteristics of the reliability of the model from the set of 
input controlled parameters. 


4. CONCLUSION 

The relevance of increasing the reliability of diagnostic assessments of electrical equipment, based 
on which decisions are made to extend its operation or withdraw it for repair, is extremely high since it 
determines the reliability of the functioning of electrical equipment and the system of electrical power as a 
whole. The use of the Bayesian classifier as a tool for increasing the methodological reliability of diagnostic 
assessments, despite some limitations, opens up extraordinary opportunities in the formation of adaptive 
decision rules that minimize the total recognition error. Models and a method for determining the classifier 
are proposed. The studies of the influence on the reliability of diagnostic assessments according to the 
classifier model of the quantity and composition of the controlled parameters involved in its formation have 
been carried out. Dependencies (8) were obtained, which can be used as functions for setting the reliability 
characteristics of the classifier model. One of the examples of the practical application of the developed 
statistical method is considered, its algorithmic implementation is presented, which provides support for 
computational processes. 
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